From fold predictions to function predictions: automation of functional site conservation analysis for functional genome predictions.

نویسندگان

  • B Zhang
  • L Rychlewski
  • K Pawłowski
  • J S Fetrow
  • J Skolnick
  • A Godzik
چکیده

A database of functional sites for proteins with known structures, SITE, is constructed and used in conjunction with a simple pattern matching program SiteMatch to evaluate possible function conservation in a recently constructed database of fold predictions for Escherichia coli proteins (Rychlewski L et al., 1999, Protein Sci 8:614-624). In this and other prediction databases, fold predictions are based on algorithms that can recognize weak sequence similarities and putatively assign new proteins into already characterized protein families. It is not clear whether such sequence similarities arise from distant homologies or general similarity of physicochemical features along the sequence. Leaving aside the important question of nature of relations within fold superfamilies, it is possible to assess possible function conservation by looking at the pattern of conservation of crucial functional residues. SITE consists of a multilevel function description based on structure annotations and structure analyses. In particular, active site residues, ligand binding residues, and patterns of hydrophobic residues on the protein surface are used to describe different functional features. SiteMatch, a simple pattern matching program, is designed to check the conservation of residues involved in protein activity in alignments generated by any alignment method. Here, this procedure is used to study conservation of functional features in alignments between protein sequences from the E. coli genome and their optimal structural templates. The optimal templates were identified and alignments taken from the database of genomic structural predictions was described in a previous publication (Rychlewski L et al., 1999, Protein Sci 8:614-624). An automated assessment of function conservation is used to analyze the relation between fold and function similarity for a large number of fold predictions. For instance, it is shown that identifying low significance predictions with a high level of functional residue conservations can be used to extend the prediction sensitivity for fold prediction methods. Over 100 new fold/function predictions in this class were obtained in the E. coli genome. At the same time, about 30% of our previous fold predictions are not confirmed as function predictions, further highlighting the problem of function divergence in fold superfamilies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Functional Investigation of the Novel BRCA1variant (Glu1661Gly) byComputationalTools andYeastTranscription Activation Assay

Introduction: Mutations in the BRCA1 gene are major risk factors for breast and ovarian cancers. However, the relationship between some BRCA1 mutations and cancer risk remains largely unknown. Cancer risk predictions could be improved by evaluation of the impairment degree in the BRCA1 functions due to a specific mutation. This study aimed to assess the functional effect of a novel variant (Glu...

متن کامل

Functional Investigation of the Novel BRCA1variant (Glu1661Gly) byComputationalTools andYeastTranscription Activation Assay

Introduction: Mutations in the BRCA1 gene are major risk factors for breast and ovarian cancers. However, the relationship between some BRCA1 mutations and cancer risk remains largely unknown. Cancer risk predictions could be improved by evaluation of the impairment degree in the BRCA1 functions due to a specific mutation. This study aimed to assess the functional effect of a novel variant (Glu...

متن کامل

Energy Conservation Potential of the Heat Pipe Heat Exchangers: Experimental Study and Predictions

The energy conservation potential of the heat pipe based heat exchangers (HPHXs) was studied in this research. To this end, a typical climate chamber as the representative of an air conditioning system was established. The performance characteristic of a typical eight-row HPHX was obtained based on the one week operation (168 h) to determine the performance characteristic curves. The coil face ...

متن کامل

dbNSFP: A Lightweight Database of Human Nonsynonymous SNPs and Their Functional Predictions

With the advance of sequencing technologies, whole exome sequencing has increasingly been used to identify mutations that cause human diseases, especially rare Mendelian diseases. Among the analysis steps, functional prediction (of being deleterious) plays an important role in filtering or prioritizing nonsynonymous SNP (NS) for further analysis. Unfortunately, different prediction algorithms u...

متن کامل

طراحی واکسن مبتنی بر کامپیوتر: گزارش کوتاه

Background: Although the conventional vaccines have been instrumented in the incidence of many infectious diseases, the advances in genetic engineering and bioinformatics have provided the opportunity for developing improved and new vaccines.Methods: Reverse vaccinology was pioneered by a group of researchers investigating development of a vaccine against serogroup B Neisseria meningitidis. Rev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Protein science : a publication of the Protein Society

دوره 8 5  شماره 

صفحات  -

تاریخ انتشار 1999